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ABSTRACT 


Data from the chloroplast genes rbcL and ndAF, totaling more than 3500 base pairs of DNA sequence, were used 
to examine the monophyly of the Scrophulariaceae, including several groups that have been suggested to belong to 
or to be derived from the Scrophulariaceae. Thirty-two taxa representing the Lamiales s.l. and outgroups were sampled 
and each of the sets of gene sequences was analyzed separately and in combination. Results indicate that two distinct 
clades composed of elements of the traditional Scrophulariaceae exist and that a monophyletic Scrophulariaceae, even 
one liberally circumscribed to include several small families, cannot be supported by these data. One group, designated 
“scroph I,” includes Verbascum, Celsia, Selago, Scrophularia, Buddleja, and Nicodemia. A second group, "scroph 
IL," includes Antirrhinum, Digitalis, Veronica, and the Plantaginaceae, Callitrichaceae, and Hippuridaceae. Schle- 
gelia and Paulownia, often assigned either to the Scrophulariaceae or Bignoniaceae, do not appear with either family. 





The Scrophulariaceae are a well-known family 
to temperate botanists and one for which a “ge- 
stalt” recognition serves the scientist well. Mem- 
bers of the family are generally recognized by their 
usually herbaceous habit, their typically bilaterally 
symmetric, tubular flowers, and their many-seeded 
capsular fruits. Taxonomists learn the exceptions 
to the **usual," such as the woody Paulownia trees 
and the nearly actinomorphic flowers of Verbas- 
cum, and learn the characters that identify related 
groups: the winged seeds lacking endosperm of the 
Bignoniaceae, the achlorophyllous parasites of the 
Orobanchaceae, the wind-pollinated Plantagina- 
ceae, the reduced ovule number in the Labiatae 
and Verbenaceae, and the inflorescence bracts and 
explosive capsules of the Acanthaceae. However, 
the characters by which we recognize plants as 
scrophs cannot be labeled shared derived traits, or 
synapomorphies, leaving open the possibility that 
the family is not monophyletic. 

The taxonomic history of the Scrophulariaceae 
reflects the problem of describing a natural group. 
Most recent angiosperm classifications (e.g., Cron- 
quist, 1981; Takhtajan, 1987; Thorne, 1992) have 
recognized a suprafamilial group centered around 


the Scrophulariaceae and containing approximate- 
ly 12 (e.g., Scrophulariales of Cronquist, 1981) to 
15 (e.g., Scrophulariinae of Thorne, 1992) fami- 
lies. Some of the more clearly defined families in 
the group are suggested to be connected to the 
Scrophulariaceae by intermediate genera (e.g., 
Nelsonia connecting with the Acanthaceae or Pau- 
lownia and Schlegelia connecting with the Big- 
noniaceae), whereas other families (e.g., Oroban- 
chaceae, Plantaginaceae, Lentibulariaceae) "may 
logically be considered to be specialized derivatives 
of the Scrophulariaceae" (Cronquist, 1981). The 
Scrophulariaceae, as circumscribed in the above- 
mentioned treatments, usually have the largest 
number of species of any family in the group (e.g., 
3000, Thorne, 1992). Most recent treatments of 
the Scrophulariaceae and related families are de- 
rived from the 19th century treatment of Bentham 
& Hooker (Bentham, 1876), who maintained sep- 
arate families for the Lentibulariaceae, Oroban- 
chaceae, Plantaginaceae, and Selaginaceae. Wett- 
stein (1895) included the Selaginaceae as a tribe 
in the Scrophulariaceae. Hallier (1903), in rec- 
ognition of the apparently derived nature of the 
Lentibulariaceae, Orobanchaceae, Plantaginaceae, 
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and Selaginaceae relative to the "core" Scrophu- 
lariaceae, included them in the Scrophulariaceae. 
However, since Hallier (1903), classifications (e.g., 
Cronquist, 1981; Takhtajan, 1987; Thorne, 1992) 
have followed Wettstein (1895) in maintaining as 
distinct families the Lentibulariaceae, Orobancha- 
ceae (except Takhtajan, 1987; Thorne, 1992), and 
Plantaginaceae. Infrafamilial classifications have 
recognized three subfamilies, Pseudosolaneae (pri- 
marily tribe Verbasceae), Antirrhinoideae, and 
Rhinanthoideae (e.g., Wettstein, 1895), or two 
subfamilies with Pseudosolaneae subsumed by An- 
tirrhinoideae (e.g., Thieret, 1967), or, when Oro- 
banchaceae is included, it is maintained as a third 
subfamily (Takhtajan, 1987; Thorne, 1992). 

The placement of several genera putatively re- 
lated to Scrophulariaceae, including a few mono- 
generic families (e.g., Callitrichaceae, Hippurida- 
ceae, Hydrostachyaceae), has been disputed in 
angiosperm classifications. Paulownia, a large 
woody tree similar in habit, as well as flower, in- 
florescence, and leaf morphology, to Catalpa (Big- 
noniaceae), was placed in the Scrophulariaceae by 
Bentham and Hooker (Bentham, 1876) and Wett- 
stein (1895), but was placed in the Bignoniaceae 
by Takhtajan (1980) and Cronquist (1981). How- 
ever, Armstrong (1985) marshaled the evidence in 
favor of its placement in the Scrophulariaceae, and 
subsequent classifications have placed it there (e.g., 
Takhtajan, 1987; Thorne, 1992). Schlegelia and 
the related genera Gibsoniothamnus and Synapsis 
similarly have been bounced back and forth be- 
tween the same two families, while not being con- 
sidered closely related to Paulownia. Bentham and 
Hooker included Schlegelia in the Bignoniaceae, 
and this placement has been followed in the mono- 
graph of the family by Gentry (1980) and in Cron- 
quist (1981) and Takhtajan (1987). Armstrong 
(1985) also considered Schlegelia and related gen- 
era and found that they did not fit with the Big- 
noniaceae (e.g., they have seeds with endosperm) 
and fit better within the Scrophulariaceae, although 
they are distinctive within that family. Thorne 
(1992) followed Armstrong (1985) by including 
the Schlegelieae in the Scrophulariaceae. 

The Selagineae, a group of several genera and 
100-200 species mostly native to South Africa 
and Madagascar, are distinguished by the presence 
of a single ovule in each of the two locules in the 
ovary (Thieret, 1967). This group was maintained 
as a distinct family by Bentham and Hooker (Ben- 
tham, 1876), but included in the Scrophulariaceae 
by Wettstein (1895) and most recent treatments 
(e.g., Takhtajan, 1987; Thorne, 1992). However, 
Cronquist (1981) placed the Selagineae in the Glob- 


ulariaceae. The Globulariaceae are a small group 
of two genera and approximately 30 species 
(Thorne, 1992) that traditionally have been con- 
sidered a distinct family, but also have been de- 
scribed as a tribe within the Scrophulariaceae (Bar- 
ringer, 1993). 

Aquatic angiosperms traditionally have been dif- 
ficult to classify, and several aquatics have been 
associated with the Scrophulariales and related or- 
ders. The Lentibulariaceae are aquatic or semi- 
aquatic and insectivorous, but retain showy insect- 
pollinated flowers that suggest a relationship with 
the Scrophulariaceae and have been classified close 
to that family or included within it (e.g., Hallier, 
1903). However, the Callitrichaceae, Hippurida- 
ceae, and Hydrostachyaceae, each a monogeneric 
family, all have very reduced floral morphology 
associated with either wind pollination (Hippuris, 
Hydrostachys) or water pollination (Callitriche), 
and their taxonomic placement has varied dra- 
matically. Based primarily on embryological char- 
acters (e.g., the presence of unitegmic and tenuin- 
ucellate ovules) and evidence from floral 
development, these taxa have been placed in the 
Asteridae near the Scrophulariales or Lamiales (ev- 
idence reviewed in Wagenitz, 1992). Cronquist 
(1981) placed the three families together in the 
order Callitrichales, but this treatment has not been 
accepted widely (Wagenitz, 1992). 

Root parasitism is a well-developed habit in the 
Scrophulariaceae, and the existence of green (pho- 
tosynthetic) hemiparasites, such as Orthocarpus, 
Pedicularis, and Striga, is presumed to indicate 
a link to the achlorophyllous holoparasites of the 
Orobanchaceae. The dramatically reduced chlo- 
roplast genome in the Orobanchaceae (dePamphilis 
& Palmer, 1990), involving the loss of many genes 
involved with photosynthesis and other functions 
(Wolfe et al., 1992), has precluded the inclusion 
of representatives of the group in chloroplast DNA 
(cpDNA) studies involving either restriction site 
mapping of the entire genome or sequencing of 
rbcL. However, evidence from the cpDNA inverted 
repeat region (C. dePamphilis, pers. comm.) and 
DNA sequencing of the nuclear 18S rDNA (A. 
Colwell, pers. comm.) provide support linking the 
Orobanchaceae to the hemiparasites of the Scroph- 
ulariaceae. 

A picture of phylogenetic relationships in the 
Asteridae has begun to emerge from studies of 
cpDNA, in which a monophyletic group, including 
the Scrophulariaceae and related families and pro- 
visionally designated the Lamiales s.l., has been 
identified (Downie & Palmer, 1992; Olmstead et 
al., 1992, 1993a). Initial studies of the chloroplast 
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gene rbcL were aimed at the identification and 
circumscription of major lineages within the As- 
teridae (Bremer et al., 1994; Olmstead et al., 1992, 
1993a). Consequently, sampling of many families 
was inadequate to test hypotheses of monophyly 
at the family level. For previous cp DNA sequencing 
studies within the Lamiales s.l., sampling in the 
Scrophulariaceae has been limited to Antirrhinum 
and Digitalis (Olmstead et al., 1993a). Of the 
disputed genera and closely related families dis- 
cussed above, only the Lentibulariaceae (Chase et 
al., 1993; Olmstead et al., 1993a) and Callitri- 
chaceae (Olmstead et al., 1992, 1993a) have been 
sampled previously. Based on these analyses, the 
Lentibulariaceae were not closely related to the 
Scrophulariaceae, whereas Callitriche, usually 
placed more closely to the Labiatae (e.g., Wagenitz, 
1992), forms a monophyletic group with Antir- 
rhinum and Digitalis (Olmstead et al., 1993a). 

In addition to the broad studies that have iden- 
tified the Lamiales s.l. clade, other studies have 
addressed particular families within the Lamiales 
s.l. using cpDNA sequence data. Scotland et al. 
(1995), using rbcL and ndhF sequences, have 
identified a monophyletic Acanthaceae, which in- 
cludes the often segregated Thunbergiaceae and 
Nelsoniaceae. Wagstaff & Olmstead (unpublished), 
using rbcL sequences, have confirmed a new di- 
vision between the Labiatae and Verbenaceae, fol- 
lowing Cantino’s (1992) morphological cladistic 
study of the Labiatae. A monophyletic Labiatae 
has been established (Cantino et al., 1992), which 
includes much of the former Verbenaceae, leaving 
a reduced and apparently monophyletic Verben- 
aceae comprising only the former subfamily Ver- 
benoideae. Also, a monophyletic Bignoniaceae has 
been circumscribed on the basis of rbcL and ndAF 
sequences (R. Spangler & R. Olmstead, unpub- 
lished), and similar work is under way on the Ges- 
neriaceae (J. Smith, pers. comm.). 

The goals of this study are to explore the pos- 
sibility that the Scrophulariaceae (e.g., sensu 
Thorne, 1992) are not a monophyletic group and 
to examine the placement of several of the dis- 
puted genera and small families putatively allied 
with the Scrophulariaceae. The goal is not to es- 
tablish an infrafamilial classification, because sub- 
stantial doubt exists regarding whether the group 
is monophyletic in the first place. Attempts to dis- 
cover an infrafamilial phylogeny using cpDNA in- 
verted repeat restriction site mapping have met 
with frustration, for example, due to the lack of 
evidence for a monophyletic Scrophulariaceae (C. 
dePamphilis, pers. comm.). However, these efforts 
have discovered many groups of related genera, 


which ultimately may bear recognition at some 
level in a revised classification of the Lamiales s.l. 

Sequencing the chloroplast gene rbcL has prov- 
en to be a useful approach for identifying phylo- 
genetic relationships among higher-order groups in 
the angiosperms (e.g., Annals of the Missouri Bo- 
tanical Garden Vol. 80(3), 1993) and has been 
particularly helpful in identifying major lineages 
within the Asteridae (Olmstead et al., 1993a). How- 
ever, the highly conserved nature of the gene re- 
sults in relatively few informative phylogenetic 
characters at the infrafamilial level. Sequencing 
studies focusing on individual families, with sufh- 
cient sampling to examine infrafamilial relation- 
ships (Conti et al., 1993; Doebley et al., 1990; 
Kim et al., 1992; Olmstead & Sweere, 1994; Soltis 
et al., 1993), have made clear the need to sample 
larger amounts of sequence to acquire sufficient 
numbers of characters to gain resolution and to 
assure greater accuracy in phylogenetic recon- 
structions. For this study, which crosses family lines 
to examine infra- as well as interfamilial relation- 
ships, a longer and more rapidly evolving cpDNA 
gene, ndhF (Olmstead & Sweere, 1994), was cho- 
sen to complement the substantial data set of rbcL 
gene sequences already available for representa- 
tives of the Lamiales s.l. By sequencing both genes 
for all taxa included in this analysis, nearly 3500 
base pairs (bp) of cpDNA are available for phy- 
logenetic inference, relative to the approximately 
1400 bp compared in studies of rbcL alone. 

The gene ndhF in tobacco encodes a protein of 
740 amino acids, which is suggested to be a subunit 
of an NADH dehydrogenase enzyme (undiscovered 
at present) in the chloroplast (Sugiura, 1992). It 
is located in the small single-copy (SSC) region of 
the chloroplast genome close to the junction with 
the inverted repeat (IR). The termination codon of 
ndhF is located 43 bp from the IR/SSC boundary 
in tobacco, and transcription proceeds toward the 
inverted repeat. A comparison of rice and tobacco 
sequences suggests that ndhF has a nucleotide 
substitution rate approximately two times higher 
than that of rbcL (Sugiura, 1989). In combination 
with the fact that ndhF is approximately 50% 
longer than rbcL, the two-fold higher substitution 
suggests a three-fold increase in cladistic charac- 
ters. In a previous study of the Acanthaceae (Scot- 
land et al., 1995), almost exactly three times the 
number of characters were derived from ndhF 
relative to rbcL (421 vs. 136, note that taxon 
sampling was slightly different for the two genes); 
however, in a study of the Solanaceae (Olmstead 
& Sweere, 1994) ndhF provided only 60% more 
characters than rbcL (100 vs. 63). 
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FicuRE 1. Map of ndAF and adjoining regions of cpDNA in tobacco. Boxes indicate reading frames; lines 
connecting boxes indicate noncoding DNA. Vertical bar at left end of orf350 indicates junction between inverted 
repeat (IR) and small single-copy region (SSC). Arrows indicate location of PCR and sequencing primers (primers 
located outside the coding region for ndhF supplied by R. Jansen, University of Texas). Cross-hatching in ndAF 
indicates portion sequenced. Overlapping regions marked above and below the map indicate the two fragments in 


which the gene was amplified. 


MATERIALS AND METHODS 


This study includes 29 representative species of 


the Lamiales s.l. and three outgroup species rep- 
resenting the most closely related orders; species 
names, voucher information, and GenBank acces- 
sion numbers or prior publication references are 
given in Table 1. Eight tribes and both subfamilies 
Antirrhinoideae and Rhinanthoideae are repre- 
sented (following Thieret, 1967), including six spe- 
cies classified as Scrophulariaceae by the most con- 
servative treatments of the family: Antirrhinum 
(subfam. Antirrhinoideae, tribe Antirrhineae), Cel- 
sia (Antirrhinoideae, Verbasceae), Scrophularia 
(Antirrhinoideae, Scrophularieae), Verbascum (An- 
tirrhinoideae, Verbasceae), Digitalis (Rhinanthoi- 
deae, Digitaleae), and Veronica (Rhinanthoideae, 
Veroniceae). Also included are disputed members 
Paulownia (Antirrhinoideae, Paulownieae), Selago 
(Antirrhinoideae, Selagineae), and Schlegelia (An- 
tirrhinoideae, Schlegelieae), and representatives of 
three small putatively related families, Callitricha- 
ceae, Hippuridaceae, and Plantaginaceae. Three 
additional species representing the Lentibulari- 
aceae were included in an analysis of rbcL se- 
quences with the 32 other species (results not 
shown). DNA sequences for the two genes were 
determined from the same plant accession for near- 
ly all taxa. Exceptions include Nicotiana and Bar- 
leria, for which different accessions of the same 
species were used to determine the two gene se- 
quences; Thunbergia, for which two different spe- 
cies were used, T. usamberica for rbcL and T. 
alata for ndhF; and Verbena, for which two dif- 
ferent species were used, V. bonariensis for rbcL 
and V. bracteata for ndhF. Of the 64 sequences 
included in this study, 43 are not previously re- 


ported. It was determined during the course of this 
study that the previously published rbcL sequence 
of Buddleja, and the DNA accession from which 
it was derived, was mistakenly that of Paulownia 
tomentosa (cf. Bremer et al., 1994; Chase et al., 
1993; Olmstead et al., 1993a; Scotland et al., 
1995); the correct sequence for Buddleja is re- 
ported here. 

Procedures for determining DNA sequences were 
as described in Olmstead & Sweere (1994). PCR 
amplification primers for rbcL are those described 
in Olmstead et al. (1992), with internal sequencing 
primers provided by G. Zurawski (DNAX Research 
Institute, Palo Alto, California). PCR amplification 
primers and internal sequencing primers for ndAF 
are described in Olmstead & Sweere (1994), with 
the exception that primers #1 and #2110R (form- 
ing the 5' and 3' ends of the amplified regions, 
respectively) were redesigned to match more ex- 
actly the homologous sequences for the Lamiales 
and were used to determine some sequences (Fig. 
1). Redesigning the primers was made possible by 
sequencing through the ends of the gene in several 
species using primers located in flanking regions 
(primers kindly provided by R. Jansen, University 
of Texas). The new primer sequences are as follows 
(for others, see Olmstead & Sweere, 1994): 41 — 
ATGGAACAGACATATCAATATG(C/G)GTGG 
and #2112R—CCC(C/T)A(C/G)ATATTTGA- 
TACCTTC(G / T)CC. 

A total of 2135 bp of the ndhF gene was am- 
plified in tobacco (somewhat more or less in other 
taxa depending on insertions and deletions) in two 
overlapping fragments of 994 bp and 1333 bp at 
the 5' 


1). Approximately 97 bp at the 3' end of the gene 


and 3' ends of the gene respectively (Fig. 


was not included. Preparation of DNA for sequenc- 
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TABLE 2. Gaps in the sequence alignment for ndhF. Polarity (insertion vs. deletion) is relative to the results of 
the cladistic analysis of the nucleotide data. Coordinates are relative to the sequence for Nicotiana tabacum. 





Insertion/deletion Size (bp) Coordinates' Taxa 
Insertion 6 657/658 Nematanthus 
Insertion 6 1425/1426 Selago 
Insertion/ deletion 9 1443/1444 Nicotiana 
Insertion 3 1443/1444 Streptocarpus 
Deletion 42 1447-1488 Callitriche 
Insertion 6 1467/1468 Streptocarpus 
Insertion 6 1470/1471 Selago 
Insertion/deletion 9 1471-1479 Borago 
Insertion/deletion 9 1474-1482 Gentiana 
Deletion 9 1477-1485 Physostegia 
Insertion 12 1481/1482? Stachytarpheta 
Deletion 3 1486-1488 Digitalis, Plantago, Veronica 
Deletion 6 1489-1494 Veronica 
Deletion 6 1498-1503 Barleria 
Insertion/deletion 9 1498-1506 Gentiana 
Deletion 3 1501-1503 Ajuga 
Deletion 9 1525-1533 Verbena 
Insertion 6 1539/1540 all Lamiales s.l.; deleted in Ajuga, Ni- 
codemia 
Deletion 3 1564-1566 Streptocarpus 
Insertion/ deletion 6 1572/1573 Gentiana 
Insertion 9 1695/1696 Nematanthus, Plantago 
Insertion/ deletion 6 1701/1702 Gentiana, Plantago (insertion) 
Insertion/ deletion 6 1702-1707 Borago 
Deletion 6 1708-1713 Barleria 
Insertion 6 1932/1933 Streptocarpus 
' */" indicates insertion between coordinates in Nicotiana; '*-" range of nucleotides deleted relative to aligned 


Nicotiana sequence. 


* Insertion not in frame with breaks between mRNA codons to enable identification of direct repeat as insertion. 


ing proceeded by first amplifying each fragment 
with balanced primer concentrations to yield dou- 
ble-stranded DNA, then producing single-stranded 
DNA by a second round of PCR using a portion 
of the initial PCR product as template and a single 
primer (Kaltenboeck et al., 1992). Standard di- 
deoxy sequencing was performed using Sequenase 
(U.S. Biochemical) and “P-labeled dATP. Each 
strand was sequenced for all taxa, requiring 12 
sequencing reactions per ndhF sequence (Fig. 1). 

Alignments of the sequences were determined 
by eye and, in the data set used for cladistic anal- 
ysis, gaps in one species relative to any other were 
scored as missing data. The first 26 bp of the coding 
sequence of rbcL correspond to the 5’ PCR primer, 
and no gaps were identified through the first 1428 
bp (the termination codon was beyond bp 1428 in 
all of the sequences), so 1402 bp of rbcL sequence 
were used for the phylogenetic analysis. Several 
gaps representing insertions, duplications, or de- 
letions were observed in the ndhF sequences (Table 


2). The first 23 bp of the coding sequence of ndhF 


correspond to the 5' PCR primer, and the 3’ primer 
corresponds to bp 2110-2135 of the Nicotiana 
tabacum sequence. Inferred lengths of the ndAF 
genes sequenced for this study, including undeter- 
mined sequence downstream of the 3' PCR primer, 
ranged in length from 2196 bp for Callitriche 
heterophylla to 2253 for Nematanthus hirsutus. 
The total aligned length including all gaps was 2167 
bp. 

The data were analyzed in three separate anal- 
yses: rbcL only, ndhF only, and both sequences 
combined. Parsimony analyses were conducted us- 
ing PAUP version 3.1 (Swofford, 1993) with all 
changes weighted equally. For each analysis 100 
random order entry replicate searches were con- 
ducted using the HEURISTIC search option with 
TBR branch swapping and with MULPARS “on” 
to save all equally most parsimonious trees. Results 
of each replicate were monitored to determine if 
multiple islands (Maddison, 1991) were discovered 
(as in Olmstead et al., 1993a). To assess the relative 
support for clades found in the combined analysis, 
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a bootstrap analysis was conducted with 100 rep- 
licates (Felsenstein, 1985). To examine possible 
circumscriptions of taxa that might represent a 
monophyletic Scrophulariaceae, four separate 
analyses were performed with each of the three 
data sets (each gene separately and the combined 
set) with various groups of taxa constrained to 
monophyly. The four constrained circumscriptions 
range from a strict definition of Scrophulariaceae, 
as found in many classifications, to a broad defi- 
nition including many taxa of disputed family mem- 
bership or belonging to small families inferred to 
be closely related to members of the Scrophulari- 
aceae based on cpDNA studies. Following these 
constrained searches, a decay analysis (Donoghue 
et al., 1992) was performed on the combined data 
set, in which all trees were saved up to the length 
of the shortest tree in which a broadly defined 
monophyletic Scrophulariaceae was found. This last 
analysis was performed to determine how much 
structure remains in a tree based on these data if 
a monophyletic Scrophulariaceae is to be accepted. 
The data used in these analyses are available from 
the first author upon request, and sequences have 


been submitted to GenBank (Table 1). 


RESULTS 


A total of 3569 bp of aligned sequence was used 
in this study, including 1402 bp (39%) of rbcL 
and 2167 aligned bp of ndhF. Of the 2167 bp of 
ndhF aligned sequence, 66 bp represent insertions 
or duplications unique to a single taxon or to two 
taxa and therefore are unable to provide any phy- 
logenetic information, leaving 2101 bp of poten- 
tially useful aligned sequence. Of the 3569 total 
bp compared, 2196 bp (61.5%) were invariant, 
640 bp (17.9%) were variable, but uninformative 
with respect to phylogeny, and 734 bp (20.6%) 
were phylogenetically informative. The rbcL se- 
quences contributed 189 informative characters, 
or approximately 26% of the total while accounting 
for 40% of the useful aligned sequence, whereas 
ndhF sequences contributed 545 informative char- 
acters, or 7476 of the total while accounting for 
only 60% of the useful aligned sequence. This 3 : 
l proportion in number of characters derived from 
ndhF relative to rbcL matches that found in the 
Acanthaceae (Scotland et al., 1995) and the pre- 
diction based on size and substitution rate of the 
two genes (Sugiura, 1989). 

The presence of insertions and deletions (indels) 
in ndhF represents a significant difference relative 
to rbcL. In this study, 25 gaps in the sequence 
alignment are required to accommodate indels ob- 
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served in the 32 sequences (Table 2). Of these, 21 
are unique to an individual sequence (based on the 
results of the phylogenetic analysis, seven are in- 
sertions, eight are deletions, and six are among the 
outgroups and therefore unordered). Two gaps each 
apparently represent two independent insertions 
(inferred from the resulting tree based only on 
nucleotide substitutions) that are of the same size 
and in the same position in two unrelated sequenc- 
es. In one case, both insertions represent direct 
repeats of a nine-bp adjacent sequence; in the other 
case, the six-bp insertions are not apparently re- 
lated to adjacent sequences and share no nucleo- 
tides in common. The remaining two indels rep- 
resent apparent synapomorphies. In one, a three- 
bp deletion is shared by Digitalis, Veronica, and 
Plantago, which together form a clade based on 
analysis of both of the gene sequences. In the other, 
a six-bp insertion forms an apparent synapomorphy 
for the entire Lamiales s.l., but gaps of the same 
size occur in the same position as the insertion in 
two ingroup sequences (/Vicodemia, Ajuga) that 
are neither related to each other nor basal in the 
clade. Thus the 25 alignment gaps appear to rep- 
resent 29 insertion or deletion events, of which 
two represent apparent synapomorphies of taxa in 
the current analysis. The phylogenetic significance 
of indels may be limited in this analysis, but their 
potential for phylogenetic inference at lower levels 
is great (see Discussion). 

The parsimony analysis of rbcL sequences yield- 
ed eight equally most parsimonious trees (length 
= 851, consistency index, CI, = 0.426, excluding 
autapomorphies) occurring in two islands of six and 
two trees, respectively. The representatives of the 
Scrophulariaceae form two distinct clades (Figs. 2- 
1), designated “‘scroph I” (containing the type ge- 
nus, Scrophularia, as well as Celsia, Verbascum, 
Selago, Buddleja, and Nicodemia) and *'scroph 
IV (Antirrhinum, Digitalis, Veronica, Plantago, 
Callitriche, and Hippuris) on Figure 4. Schlegelia 
forms a clade with Myoporum in all rbcL trees 
and Paulownia forms a branch sister to the Big- 
noniaceae in the island of two trees or by itself 
near the base of the order in the island of six trees. 
Each island exhibits considerable resolution (trees 
not shown), but the strict consensus of all eight 
trees exhibits much less resolution (Fig. 2). The 
primary difference between the islands is in the 
placement of the Labiatae, which occur as sister 
group to a clade comprising the Acanthaceae and 
"scroph ITI" in island-2 (terminology of Maddison, 
1991) and as a member of a large clade along with 
the Verbenaceae, Gesneriaceae, and “‘scroph I” in 
island-6. These results are congruent with those 
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FiGuRE 2. Results of parsimony analysis of rbcL sequences. On the left is one of six shortest trees in island-6 
(length = 853) with branch lengths indicated. Arrows indicate internodes that collapse in the strict consensus of the 
six trees in island-6. On the right is a strict consensus of the eight shortest trees found, which includes the six trees 
in island-6 and the two trees in island-2. 
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obtained in previous analyses of rbcL sequences 
(e.g., Olmstead et al., 1993a). 

The constrained search with a strict definition 
of Scrophulariaceae monophyly (Antirrhinum, 
Celsia, Digitalis, Scrophularia, Selago, Verbas- 
cum, Veronica) yielded 37 trees 24 steps longer 
than maximum parsimony (MP). A more relaxed 
constraint including Plantago, Callitriche, and 
Hippuris yielded 26 trees 9 steps longer than MP. 
Adding additional taxa, suggested by the maximum 
parsimony results to be potentially part of a mono- 
phyletic Scrophulariaceae clade, gave results as 
follows: (1) adding Buddleja and Nicodemia, 4 
trees—3 steps; (2) adding Buddleja, Nicodemia, 
and Myoporum, 238 trees—8 steps. An additional 
analysis was performed, in which the rbcL se- 
quences for Byblis, Pinguicula, and Utricularia 
(representing Lentibulariaceae) were retrieved from 
GenBank and added to the 32 sequences used 
elsewhere in this study. The results (not shown) 
indicate that the Lentibulariaceae represent a dis- 
tinct lineage, not related to either of the two scroph 
clades identified in the analyses described above, 
confirming previously published results from a more 
limited sampling (Olmstead et al., 1993a). 

The analysis of ndhF sequences yielded two 
trees (length = 2683, CI = 0.455, excluding au- 
tapomorphies). The representatives of the Scroph- 
ulariaceae form the same two distinct groups in 
the strict consensus tree (Fig. 3) as were found in 
the analysis of rbcL sequences. The closest groups 
to “‘scroph I” in order of increasing distance are 
Myoporaceae, Bignoniaceae, Acanthaceae, Ver- 
benaceae, and Labiatae. “‘Scroph II" is sister group 
to the group just described. Schlegelia forms a 
clade with the Verbenaceae and Paulownia is sister 
group to the Labiatae. The constrained search with 
a strict definition of Scrophulariaceae yielded 4 
trees 74 steps longer than MP. The more relaxed 
constraint (including Plantago, Callitriche, and 
Hippuris) yielded 10 trees 23 steps longer than 
MP. Adding additional taxa gave results as follows: 
(1) adding Buddleja and Nicodemia, 8 trees—8 
steps; (2) adding Buddleja, Nicodemia, and My- 
oporum, 18 trees—4 steps. 

The analysis of the combined rbcL and ndhF 
sequences yielded two trees (length = 3555, CI = 
0.445 excluding autapomorphies). The same two 
distinct clades of Scrophulariaceae were obtained, 
and the branching order in the strict consensus 
tree (Fig. 4) is identical with respect to family-level 
groups to the results of the ndAF sequences. Re- 
sults of the bootstrap analysis are shown in Figure 
4. The constrained search with a strict definition 
of Scrophulariaceae yielded 3 trees 93 steps longer 


than MP. The more relaxed constraint (including 
Plantago, Callitriche, and Hippuris) yielded 5 
trees 27 steps longer than MP. Adding additional 
taxa gave results as follows: (1) adding Buddleja 
and Nicodemia, 5 trees—9 steps; (2) adding Bud- 
dleja, Nicodemia, and Myoporum, 19 trees—4 
steps. A decay analysis including all trees up to 
four steps longer than MP was performed to ex- 
amine the degree of resolution remaining in the 
data, if a broadly defined Scrophulariaceae were 
to be accepted. In the strict consensus of the 1107 
trees obtained, both **scroph I” and "scroph II" 
remain, as well as clades representing all of the 





families for which more than one representative 
was included (Buddlejaceae— within ‘‘scroph I,” 
Acanthaceae, Bignoniaceae, Gesneriaceae, Labia- 
tae, Verbenaceae) and the Lamiales s.l. as a whole. 
No other resolution within the ingroup is evident. 


DISCUSSION 


Improved methods for obtaining DNA sequences 
have resulted in many systematists turning to direct 
sequence comparisons for molecular phylogenetic 
studies, rather than the indirect assessment of DNA 
sequence variation obtained using restriction site 
analysis. The initial widespread use of rbcL se- 
quences was a product of two important factors: 
(1) the size and variability of the gene are appro- 
priate for many systematic problems at the level 
of family and above and (2) primers for PCR am- 
plification and sequencing were readily available to 
the plant systematics community. However, many 
of the problems of interest to plant systematists 
are at a level of phylogenetic divergence below 
which rbcL can provide sufficient information, or 
else represent problems of resolution among closely 
spaced branch points for which having more cla- 
distic characters is desirable. Efforts to respond to 
these needs have resulted in the development of 
several alternative genes or regions of DNA for 
molecular phylogenetics (e.g., Baldwin, 1992; 
Johnson & Soltis, 1995, this issue; Olmstead & 
Sweere, 1994; Steele & Vilgalys, 1994), most of 
which have average substitution rates that are high- 
er than that for rbcL (see Hoot et al., 1995, this 
issue, for an exception). For many studies at the 
infra- or interfamilial level, rbcL may still play a 
useful role in combination with another gene to 
increase the number of characters available (Olm- 
stead & Sweere, 1994; Hoot et al., 1995, this 
issue; Johnson & Soltis, 1995; Soltis et al., 1993). 
In this study, the numbers of informative char- 
acters obtained from two genes, rbcL and ndAF, 
expressed in terms of characters per base pair of 
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FiGURE 3. One of the two shortest trees resulting from parsimony analysis of ndhF sequences (length = 2683) 
with branch lengths indicated. Arrow indicates internode that collapses in strict consensus of the two trees. 
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useful aligned sequence, are 0.13 and 0.26 re- 
spectively, and the amount of sequence obtained 
was 50% greater for ndhF than for rbcL. Thus 
ndhF yielded three times as many characters as 
did rbcL. The problem of lineage identification 
(e.g., family-level clades) in the Lamiales s.l. and 
the relationship among them has been identified as 
one for which more data than are available from 
rbcL may be needed for resolution (Olmstead et 
al., 1993a). 

ndhF sequences afford more than simply an 
additional source of nucleotide substitutions for 
phylogenetic inference. Indels are a regular, if in- 
frequent, occurrence in the middle of the coding 
sequence of ndAF. This is unlike rbcL, for which 
insertions and deletions are unknown, except at 
the very end of the coding sequence where it is 
often difficult to discriminate insertions from point 
mutations in the termination codon resulting in an 
extension of the 3' end. This feature of the gene 
may yield powerful phylogenetic characters that 
may be useful in many studies of infrafamilial phy- 
logeny as has been demonstrated for matK (John- 
son & Soltis, 1995). 

The alignment of protein-coding sequences con- 
taining indels is simplified, relative to noncoding 
DNA or rRNA coding genes, by the constraint of 
the genetic code for protein-coding genes. Align- 
ments *'by eye" are easier by virtue of including 
three nucleotides in the alignment frame. A group 
of relatively closely related species is unlikely to 
be so divergent that alignment matches will be 
obscured. In contrast, in noncoding or rRNA se- 
quences indels may be of any length, and the 
problem of aligning one-base or two-base gaps is 
much greater. In the event that divergence of 
protein coding DNA sequences is great enough that 
alignment **by eye" is difficult, aligning the trans- 
lated amino acid sequences may often resolve the 
placement of gaps that are not apparent in the 
DNA sequences. 

The 25 alignment gaps in the set of ndAF 
sequences used in this study represent 29 evolu- 
tionary events as compared to the 2683 nucleotide 
substitution events (as inferred from the results of 
the cladistic analysis of the nucleotide information). 
The overall consistency index for nucleotide sub- 
stitutions (including autapomorphies) is 0.56, 
whereas the same estimate for indels is 0.86. In 
this analysis, most of the indels were uninformative; 
however, each of the autapomorphic indels may 
be informative at a lower taxonomic level, as was 
found in several instances with matK in the Sax- 
ifragaceae s.s. and Polemoniaceae (Johnson & Sol- 
tis, 1995). In the Solanaceae (Olmstead & Sweere, 
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1994), two additional indels were documented, each 
of which was unique to one of the 17 species 
included in that study. However, the nine-bp indel 
unique to /Vicotiana in this study is common to all 
members of the Solanaceae examined and found 
in none of the other sequences yet obtained in the 
Asteridae, including those for the sister family Con- 
volvulaceae, thereby providing a synapomorphy for 
the Solanaceae. In the Acanthaceae (Scotland et 
al., 1995), 14 indels were identified, of which 8 
were unique to a single sequence and 6 were phy- 
logenetically informative. Of the six informative 
ones, only one was an evident case of parallelism, 
and one other required two events in the shortest 
tree but could be explained as a single event in a 
tree two steps longer. 

The observation of parallel occurrence of these 
infrequent events is more understandable when one 
takes into consideration the nonrandom distribution 
of indels in the sequence (Table 2, Fig. 5). Nineteen 
gaps, representing 21 inferred events, occur in a 
1 48-bp region (between positions 1425 and 157: 
in Nicotiana), and another 4 gaps (6 events) occur 
in an 18-bp region (1695-1713 in Nicotiana). 
Only two gaps, each representing a unique event, 
occur outside these regions (insertions at position 
657/658 and 1932/1933). The higher incidence 
of indels is associated with higher substitution rates 
in the 3' half of the ndhF sequence (Olmstead & 
Sweere, 1994), making that portion of the gene, 
by itself, appropriate for some phylogenetic studies 
(Catalan & Olmstead, unpublished). 

The results of three separate analyses are pre- 
sented (Figs. 2—4): rbcL sequences, ndhF sequenc- 
es, and a combined data set with both sequences. 
The case has been made for conducting separate 
analyses for data sets derived from different genes 
(e.g., Swofford, 1991) in the event that one se- 
quence exhibits strong interactions among positions 
that may bias substitutions, thereby violating the 
assumption of nonindependence among characters. 
However, no evidence of such differences has been 
found in prior analyses of rbcL and ndAF sequenc- 
es (Olmstead & Sweere, 1994; Scotland et al., 
1995), and none is anticipated here. Performing 
the separate analysis does afford the opportunity 
to examine the efficacy of each gene for phylogeny 
reconstruction and to examine the effect of par- 
titioning a data set to see whether strongly sup- 
ported clades in one partition are equally supported 
in another (Olmstead & Sweere, 1994). The results 
of the analyses of the two individual data sets for 
this group of taxa (Figs. 2-3) are congruent with 
respect to finding the same clades that are the most 
well-supported clades in the combined analysis (Fig. 
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(length = 3555) with branch lengths indicated above the internodes (the other tree is the same as Fig. 3). Numbers 
below the internodes indicate bootstrap and decay (in parentheses) values. Arrow indicates internode that collapses 
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4). The differences between the two results occur 
in areas where support is weak in both individual 
trees. Groups identified by rbcL, but not ndAF, all 
had five or fewer synapomorphies on the rbcL tree, 
and groups identified by ndAF, but not rbcL, all 
had 12 or fewer synapomorphies on the ndAF tree; 
groups in common in both results had much higher 
character support. The results of the combined 
analysis (Fig. 4) are nearly identical to the results 
of the ndAF analysis (Fig. 3), with one of the two 
most parsimonious trees found in each analysis 
being identical. However, rbcL informs the com- 
bined result in several ways: (1) ndAF does not 
fully resolve internal relationships in the **scroph 
II" clade, whereas rbcL and the combined analyses 
do. (2) ndhF and rbcL resolve the relationships 
among the three representatives of the Bignonia- 
ceae differently; the combined analysis remains 
unresolved. (3) Most importantly, at almost every 
node where the combined tree agrees with the nd AF 
result, but not the rbcL result, rbcL provides ad- 
ditional characters in support of the conclusions, 
indicating that phylogenetic signal is present in the 
"rbcL partition" for some portions of the tree 
where very few characters are informative and may 
be outweighed by chance homoplasy in that rela- 
tively small partition of the data. 


SYSTEMATIC CONCLUSIONS 


The primary systematic conclusion of this study 
is that the Scrophulariaceae, as circumscribed in 
most taxonomic treatments, are polyphyletic. The 
polyphyly of the Scrophulariaceae is complex in 
the following ways: (1) There exist two distinct 
clades in this study (and possibly more in nature) 
that contain elements of the traditional Scrophu- 
lariaceae (referred to as 
IT" in Fig. 4). (2) There are groups that are ex- 
cluded from the family in traditional classifications 


é 


‘scroph I" and "scroph 


that are clearly related to and derived from ele- 
ments of the traditional Scrophulariaceae (e.g., 
Plantaginaceae, Callitrichaceae, Hippuridaceae, and 
apparently Orobanchaceae, as noted previously, 
which is not included in this study). (3) Some dis- 
puted genera, assigned to the Scrophulariaceae in 
some treatments (e.g., Schlegelia and Paulownia 
in Thorne, 1992) do not belong to either * 
I" or “scroph II." 


'scroph 


— 
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The results of cladistic analysis of data sets de- 
rived from both rbcL and ndhF identify most of 
the same terminal lineages within the Lamiales s.l. 
Represented in these lineages are the Acanthaceae, 
Labiatae, Verbenaceae, Bignoniaceae, Gesneri- 
aceae, and two groups of taxa containing elements 
of the traditional Scrophulariaceae. The “‘scroph 
l" lineage consists, in this study, primarily of el- 
ements that are distinctive and often isolated within 
or near the traditional Scrophulariaceae: Verbas- 
cum and Celsia (Antirrhinoideae, Verbasceae), Se- 
lago (Antirrhinoideae, Selagineae), Buddleja and 
Nicodemia (Buddlejaceae), which are often placed 
close to the Scrophulariaceae, but not within it 
(e.g., Cronquist, 1981), and the type genus Scro- 
phularia (Antirrhinoideae, Scrophularieae). The 
Verbasceae, which have nearly actinomorphic flow- 
ers, were previously classified as subfamily Pseu- 
dosolaneae (Wettstein, 1895) which suggested con- 
nections to the Solanaceae. In fact, Bentham and 
Hooker (Bentham, 1876) placed the solanaceous 
tribe Salpiglossideae, which have zygomorphic flow- 
ers, unlike most Solanaceae, in the Scrophularia- 
ceae in recognition of the similarity. The Selagineae 
have been segregated from the Scrophulariaceae 
(e.g., Bentham & Hooker in Bentham, 1876) into 
their own family primarily on the basis of the re- 
duction in ovule number to one per carpel (Thieret, 
1967). Reduction in ovule number to one or two 
per carpel, at times with each carpel subdivided 
into two uniovulate mericarps, has occurred in- 
dependently in the Lamiales s.l. on several occa- 
sions (Wagstaff & Olmstead, unpublished). The 
Buddlejaceae are provisionally included in the 
"scroph I” clade, rather than as sister group to it 
(as suggested by the ndhF and combined results), 
because the rbcL results include them within the 
otherwise completely scroph group and the boot- 
strap and decay values determined for the com- 
bined data set do not unequivocally reject that 
hypothesis. Recognition of the fact that the genus 
Scrophularia belongs in this group means that a 
phylogenetic classification would have to recognize 
"scroph I’ or some portion of it as Scrophulari- 
aceae, and “‘scroph II" would require a new family 
name. 

The ‘‘scroph IT" lineage consists of the tradi- 
tional scrophs Antirrhinum (Antirrhinoideae, An- 
tirrhineae), Digitalis (Rhinanthoideae, Digitaleae), 


in strict consensus of the two trees. Dark bars indicate a deletion (within “‘scroph II") and an insertion (at the base 
of the Lamiales s.l.) in the ndhF sequences. Question marks (?) next to Paulownia and Schlegelia indicate that their 


classification into a higher order taxon remains uncertain. 
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and Veronica (Rhinanthoideae, Veroniceae), and 
representatives of the derived families Callitricha- 
ceae, Hippuridaceae, and Plantaginaceae. Thieret 
(1967) suggested that the Scrophulariaceae be di- 
vided into two subfamilies with the Antirrhinoideae 
ancestral to the Rhinanthoideae. In this analysis, 
members of the Antirrhinoideae are included in 
both “‘scroph I’ and “‘scroph IT" lineages, making 
the subfamily polyphyletic, whereas the sampled 
members of the Rhinanthoideae form a monophy- 
letic group with Plantago. The Rhinanthoideae 
contain all of the hemiparasitic Scrophulariaceae, 
from which the Orobanchaceae are evidently de- 
rived (C. dePamphilis, pers. comm.; A. Colwell, 
pers. comm.), thereby suggesting that Orobanche 
and relatives belong in this clade. The Plantagi- 
naceae exhibit modifications to floral morphology 
associated with wind pollination, which have re- 
sulted in their segregation as a separate family, 
while retaining enough similarity to be recognized 
as close to or derived from the Scrophulariaceae. 
However, the aquatic families Callitrichaceae and 
Hippuridaceae exhibit extremely modified repro- 
ductive morphology, which has baffled previous 
efforts to classify them with their closest relatives 
in the Scrophulariaceae. The third aquatic family 
in Cronquist’s (1981) Callitrichales, the Hydro- 
stachyaceae, has been found to be unrelated to the 
Callitrichaceae and Hippuridaceae and is most 
closely related to the Hydrangeaceae in the Cor- 
nales on the basis of rbcL sequences (Hempel et 
al., in press). 

The phylogenetic position and classification of 
Schlegelia and Paulownia remain unclear. rbcL 
and ndAF sequences each suggest a different place- 
ment for Schlegelia (near Myoporaceae—rbcL; 
near Verbenaceae—ndhF) and Paulownia (iso- 
lated near the base of the order or with the Big- 
noniaceae—rbcL; near Labiatae—ndhF). The 
combined analysis provides relatively weak evi- 
dence for the placement of Schlegelia near the 
Verbenaceae (33% bootstrap value) and Paulow- 
nia near the Labiatae (50% bootstrap value). The 
evidence does suggest that neither genus belongs 
with either of the lineages containing other mem- 
bers of the Scrophulariaceae. Schlegelia, and the 


— 


FIGURE 5. 
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related genera Gibsoniothamnus and Synapsis, 
may represent one of several small lineages in the 
Lamiales s.l. that have been "shoehorned" into 
families representing larger lineages in the past. 
The Lentibulariaceae appear to represent another 
small independent lineage (Olmstead et al., 19932) 
that has been suggested incorrectly to be a deriv- 
ative of the Scrophulariaceae (e.g., Cronquist, 
1981). An expanded analysis of rbcL sequences, 
including three representatives of the Lentibular- 
iaceae in addition to the 32 taxa in this study, 
indicates that the Lentibulariaceae are not related 
to either of the scroph lineages identified here (re- 
sults not shown). 

Given that the phylogenetic hypothesis derived 
from the parsimony analysis of the cpDNA se- 
quences presented here may not be the true phy- 
logeny of the group, it is instructive to examine 
prospective circumscriptions of groups that may 
represent a monophyletic Scrophulariaceae. This 
was done by imposing on the parsimony analysis 
constraints that require a group to be monophy- 
letic, while finding the most parsimonious trees. 
Several prospective circumscriptions were exam- 
ined, beginning with a very strict definition match- 
ing most recent classifications and expanding to a 
broad circumscription that includes all taxa be- 
longing to the lineages “‘scroph I" and “‘scroph II" 
and Myoporum. Not surprisingly, parsimony anal- 
yses constrained to the narrowest circumscriptions 
resulted in trees that were much longer than max- 
imum parsimony (e.g., 93 steps longer for the most 
conservative definition of Scrophulariaceae in the 
combined analysis of rbcL and ndAF sequences). 
However, a broadly circumscribed Scrophularia- 
ceae, including the Plantaginaceae, Callitricha- 
ceae, Hippuridaceae, Buddlejaceae, and Myopo- 
rum are monophyletic in trees four steps longer 
than maximum parsimony in the combined anal- 
ysis. 

If the results of a parsimony analysis were to 
be used as evidence for classification in which a 
group that requires trees four steps longer than 
maximum parsimony might be acceptable, then it 
is relevant to ask how many trees are found and 
how much resolution is retained when all trees up 


Histograms representing the changes by nucleotide position inferred over the shortest estimated tree 


for the combined data (tree in Fig. 4) for the genes ndhF and rbcL. The nucleotide position numbers refer to the 
aligned sequences used in the phylogenetic analysis beginning with the first nucleotide after the 5' PCR primer and 
ending with the last nucleotide before the 3' PCR primer. Arrows indicate locations of the PCR and sequencing 
primers. The bars under the ndAF sequence indicate the locations of gaps in the sequences (numbers refer to Table 2). 
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to four steps longer than the shortest are examined. 
A decay analysis saving all trees up to four steps 
longer than maximum parsimony found 1107 trees. 
The consensus among those trees shows that all 
families for which more than one representative 
were included are supported and that both “‘scroph 
I" and “‘scroph IT" lineages are supported, but no 
other resolution is possible within the Lamiales s.l. 
Whereas there is no assurance that the shortest 
tree is the true phylogeny for this group of taxa, 
the fact that the “‘scroph I” and **scroph IT" lin- 
eages are obtained even in the consensus of the 
trees up to four steps longer suggests that their 
classification as two distinct groups comparable to 
other groups resolved at this level (i.e., family) 
would be a conservative move and would produce 
a more stable classification when further evidence 
is obtained. 

The debate has subsided, but no consensus has 
been reached on the distinction between polyphyly 
and paraphyly (or even monophyly— see Cron- 
quist, 1987). Even Hennig used different criteria 
for the terms, first defining polyphyly on the basis 
of shared, convergent characters and paraphyly 
on the basis of shared, ancestral characters (Hen- 
nig, 1966), then (Hennig, 1975) distinguishing the 
two by whether the common ancestral (stem) spe- 
cies of a group would be included (paraphyly) or 
not included (polyphyly) in that group. The point 
at which the debate impacts systematics and clas- 
sification most strongly centers on whether a great- 
er distinction needs to be made between mono- 
phyletic versus para- and polyphyletic groups (Don- 
oghue & Cantino, 1988) or mono- and paraphyletic 
versus polyphyletic groups (Cronquist, 1987). This 
study serves a useful purpose by illuminating how 
difficult and pointless it is to try to distinguish 
between paraphyly and polyphyly as opposed to 
monophyly. Perhaps a group that is defined on the 
basis of plesiomorphic characters or by the inclu- 
sion of a stem species could be recognized and 
named Scrophulariaceae, but such a classification 
would serve poorly any evolutionary applications, 
because virtually all of the Lamiales s.l. could be 
included in an evolutionary lineage identified by 
those traits or derived from that stem species. The 
fact that progress is being made to identify and 
circumscribe monophyletic families successfully in 
the Lamiales s.l. argues against the continued rec- 
ognition of a traditional Scrophulariaceae and for 
the recognition of family-level groups that were 
once part of the Scrophulariaceae. The circum- 
scription and formal recognition of such groups 
awaits the completion of work in progress (C. 


dePamphilis, R. Olmstead & A. Wolfe, unpub- 


lished). It is hoped that the recognition of this 
division within the Scrophulariaceae based on 
cpDNA will encourage additional morphological and 
anatomical work aimed at identifying monophyletic 
groups containing elements of the traditional 
Scrophulariaceae. 
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